Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 5008 |
| Missing cells | 359 |
| Missing cells (%) | 0.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 450.1 KiB |
| Average record size in memory | 92.0 B |
Variable types
| Numeric | 9 |
|---|---|
| DateTime | 1 |
| Categorical | 5 |
Ruta has a high cardinality: 4124 distinct values | High cardinality |
OperadOR has a high cardinality: 2267 distinct values | High cardinality |
ac_type has a high cardinality: 2468 distinct values | High cardinality |
Registros has a high cardinality: 4700 distinct values | High cardinality |
Resumen has a high cardinality: 4857 distinct values | High cardinality |
Unnamed: 0 is highly overall correlated with Año_realializado | High correlation |
Todos_abordo is highly overall correlated with Pasajeros_a_bordo and 3 other fields | High correlation |
Pasajeros_a_bordo is highly overall correlated with Todos_abordo and 3 other fields | High correlation |
Tripulacion_abordo is highly overall correlated with Todos_abordo and 3 other fields | High correlation |
cantidad de fallecidos is highly overall correlated with Todos_abordo and 4 other fields | High correlation |
Pasajeros_fallecidos is highly overall correlated with Todos_abordo and 2 other fields | High correlation |
Tripulacionfallecida is highly overall correlated with Tripulacion_abordo and 1 other fields | High correlation |
Año_realializado is highly overall correlated with Unnamed: 0 | High correlation |
Registros has 272 (5.4%) missing values | Missing |
Resumen has 59 (1.2%) missing values | Missing |
suelo is highly skewed (γ1 = 49.20313053) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Ruta is uniformly distributed | Uniform |
Registros is uniformly distributed | Uniform |
Resumen is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
Pasajeros_a_bordo has 869 (17.4%) zeros | Zeros |
cantidad de fallecidos has 76 (1.5%) zeros | Zeros |
Pasajeros_fallecidos has 1040 (20.8%) zeros | Zeros |
Tripulacionfallecida has 400 (8.0%) zeros | Zeros |
suelo has 4716 (94.2%) zeros | Zeros |
Reproduction
| Analysis started | 2023-05-24 03:51:02.489508 |
|---|---|
| Analysis finished | 2023-05-24 03:52:33.080778 |
| Duration | 1 minute and 30.59 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 5008 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2503.5 |
| Minimum | 0 |
|---|---|
| Maximum | 5007 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 250.35 |
| Q1 | 1251.75 |
| median | 2503.5 |
| Q3 | 3755.25 |
| 95-th percentile | 4756.65 |
| Maximum | 5007 |
| Range | 5007 |
| Interquartile range (IQR) | 2503.5 |
Descriptive statistics
| Standard deviation | 1445.8294 |
|---|---|
| Coefficient of variation (CV) | 0.57752323 |
| Kurtosis | -1.2 |
| Mean | 2503.5 |
| Median Absolute Deviation (MAD) | 1252 |
| Skewness | 0 |
| Sum | 12537528 |
| Variance | 2090422.7 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 3336 | 1 | < 0.1% |
| 3343 | 1 | < 0.1% |
| 3342 | 1 | < 0.1% |
| 3341 | 1 | < 0.1% |
| 3340 | 1 | < 0.1% |
| 3339 | 1 | < 0.1% |
| 3338 | 1 | < 0.1% |
| 3337 | 1 | < 0.1% |
| 3335 | 1 | < 0.1% |
| Other values (4998) | 4998 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 5007 | 1 | |
| 5006 | 1 | |
| 5005 | 1 | |
| 5004 | 1 | |
| 5003 | 1 | |
| 5002 | 1 | |
| 5001 | 1 | |
| 5000 | 1 | |
| 4999 | 1 | |
| 4998 | 1 |
fecha
Date
| Distinct | 4577 |
|---|---|
| Distinct (%) | 91.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Minimum | 1908-09-17 00:00:00 |
|---|---|
| Maximum | 2021-07-06 00:00:00 |
Ruta
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 4124 |
|---|---|
| Distinct (%) | 82.4% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Memory size | 39.2 KiB |
| Moscow, Russia | 16 |
|---|---|
| Manila, Philippines | 15 |
| New York, New York | 14 |
| Sao Paulo, Brazil | 13 |
| Cairo, Egypt | 13 |
| Other values (4119) |
Length
| Max length | 72 |
|---|---|
| Median length | 49 |
| Mean length | 20.812712 |
| Min length | 5 |
Characters and Unicode
| Total characters | 104126 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3687 ? |
|---|---|
| Unique (%) | 73.7% |
Sample
| 1st row | Fort Myer, Virginia |
|---|---|
| 2nd row | Juvisy-sur-Orge, France |
| 3rd row | Atlantic City, New Jersey |
| 4th row | Victoria, British Columbia, Canada |
| 5th row | Over the North Sea |
Common Values
| Value | Count | Frequency (%) |
| Moscow, Russia | 16 | 0.3% |
| Manila, Philippines | 15 | 0.3% |
| New York, New York | 14 | 0.3% |
| Sao Paulo, Brazil | 13 | 0.3% |
| Cairo, Egypt | 13 | 0.3% |
| Bogota, Colombia | 12 | 0.2% |
| Rio de Janeiro, Brazil | 12 | 0.2% |
| Near Moscow, Russia | 11 | 0.2% |
| Chicago, Illinois | 11 | 0.2% |
| Tehran, Iran | 10 | 0.2% |
| Other values (4114) | 4876 |
Length
| Value | Count | Frequency (%) |
| near | 1350 | 9.2% |
| off | 350 | 2.4% |
| russia | 255 | 1.7% |
| new | 229 | 1.6% |
| brazil | 176 | 1.2% |
| colombia | 153 | 1.0% |
| canada | 131 | 0.9% |
| france | 127 | 0.9% |
| california | 117 | 0.8% |
| mexico | 113 | 0.8% |
| Other values (4153) | 11652 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13037 | 12.5% |
| 9703 | 9.3% | |
| e | 7073 | 6.8% |
| i | 6567 | 6.3% |
| n | 6545 | 6.3% |
| r | 6035 | 5.8% |
| o | 5367 | 5.2% |
| , | 5210 | 5.0% |
| l | 4000 | 3.8% |
| s | 3530 | 3.4% |
| Other values (80) | 37059 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 74113 | |
| Uppercase Letter | 14738 | 14.2% |
| Space Separator | 9704 | 9.3% |
| Other Punctuation | 5357 | 5.1% |
| Dash Punctuation | 105 | 0.1% |
| Decimal Number | 66 | 0.1% |
| Control | 21 | < 0.1% |
| Close Punctuation | 11 | < 0.1% |
| Open Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13037 | |
| e | 7073 | |
| i | 6567 | |
| n | 6545 | |
| r | 6035 | 8.1% |
| o | 5367 | 7.2% |
| l | 4000 | 5.4% |
| s | 3530 | 4.8% |
| t | 3112 | 4.2% |
| u | 2756 | 3.7% |
| Other values (31) | 16091 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2032 | |
| C | 1456 | 9.9% |
| S | 1145 | 7.8% |
| M | 999 | 6.8% |
| B | 952 | 6.5% |
| A | 920 | 6.2% |
| P | 787 | 5.3% |
| I | 720 | 4.9% |
| R | 652 | 4.4% |
| O | 588 | 4.0% |
| Other values (17) | 4487 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 1 | 15 | |
| 2 | 9 | 13.6% |
| 5 | 8 | 12.1% |
| 8 | 3 | 4.5% |
| 7 | 2 | 3.0% |
| 3 | 2 | 3.0% |
| 9 | 2 | 3.0% |
| 6 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5210 | |
| . | 115 | 2.1% |
| ' | 24 | 0.4% |
| / | 6 | 0.1% |
| & | 1 | < 0.1% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9703 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 16 | ||
| 5 | 23.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88851 | |
| Common | 15275 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13037 | |
| e | 7073 | 8.0% |
| i | 6567 | 7.4% |
| n | 6545 | 7.4% |
| r | 6035 | 6.8% |
| o | 5367 | 6.0% |
| l | 4000 | 4.5% |
| s | 3530 | 4.0% |
| t | 3112 | 3.5% |
| u | 2756 | 3.1% |
| Other values (58) | 30829 |
Common
| Value | Count | Frequency (%) |
| 9703 | ||
| , | 5210 | |
| . | 115 | 0.8% |
| - | 105 | 0.7% |
| 0 | 24 | 0.2% |
| ' | 24 | 0.2% |
| 16 | 0.1% | |
| 1 | 15 | 0.1% |
| ) | 11 | 0.1% |
| ( | 11 | 0.1% |
| Other values (12) | 41 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104084 | |
| None | 42 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 13037 | 12.5% |
| 9703 | 9.3% | |
| e | 7073 | 6.8% |
| i | 6567 | 6.3% |
| n | 6545 | 6.3% |
| r | 6035 | 5.8% |
| o | 5367 | 5.2% |
| , | 5210 | 5.0% |
| l | 4000 | 3.8% |
| s | 3530 | 3.4% |
| Other values (63) | 37017 |
None
| Value | Count | Frequency (%) |
| é | 14 | |
| ö | 5 | 11.9% |
| Ã | 4 | 9.5% |
| ó | 4 | 9.5% |
| á | 2 | 4.8% |
| ï | 2 | 4.8% |
| ô | 1 | 2.4% |
| è | 1 | 2.4% |
| Ã | 1 | 2.4% |
| ä | 1 | 2.4% |
| Other values (7) | 7 |
OperadOR
Categorical
| Distinct | 2267 |
|---|---|
| Distinct (%) | 45.4% |
| Missing | 10 |
| Missing (%) | 0.2% |
| Memory size | 39.2 KiB |
| Aeroflot | 253 |
|---|---|
| Military - U.S. Air Force | 141 |
| Air France | 74 |
| Deutsche Lufthansa | 63 |
| United Air Lines | 44 |
| Other values (2262) |
Length
| Max length | 65 |
|---|---|
| Median length | 47 |
| Mean length | 18.957583 |
| Min length | 3 |
Characters and Unicode
| Total characters | 94750 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1734 ? |
|---|---|
| Unique (%) | 34.7% |
Sample
| 1st row | Military - U.S. Army |
|---|---|
| 2nd row | Military - U.S. Navy |
| 3rd row | Private |
| 4th row | Military - German Navy |
| 5th row | Military - German Navy |
Common Values
| Value | Count | Frequency (%) |
| Aeroflot | 253 | 5.1% |
| Military - U.S. Air Force | 141 | 2.8% |
| Air France | 74 | 1.5% |
| Deutsche Lufthansa | 63 | 1.3% |
| United Air Lines | 44 | 0.9% |
| China National Aviation Corporation | 43 | 0.9% |
| Military - U.S. Army Air Forces | 43 | 0.9% |
| Pan American World Airways | 41 | 0.8% |
| American Airlines | 37 | 0.7% |
| US Aerial Mail Service | 35 | 0.7% |
| Other values (2257) | 4224 |
Length
| Value | Count | Frequency (%) |
| air | 1481 | 10.3% |
| 961 | 6.7% | |
| airlines | 840 | 5.8% |
| military | 778 | 5.4% |
| force | 557 | 3.9% |
| airways | 453 | 3.1% |
| u.s | 302 | 2.1% |
| aeroflot | 265 | 1.8% |
| lines | 184 | 1.3% |
| royal | 152 | 1.1% |
| Other values (2079) | 8422 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10212 | 10.8% |
| 9421 | 9.9% | |
| r | 8849 | 9.3% |
| a | 7786 | 8.2% |
| e | 6780 | 7.2% |
| n | 5528 | 5.8% |
| A | 5083 | 5.4% |
| o | 4380 | 4.6% |
| l | 4079 | 4.3% |
| s | 4000 | 4.2% |
| Other values (77) | 28632 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 68181 | |
| Uppercase Letter | 15071 | 15.9% |
| Space Separator | 9422 | 9.9% |
| Dash Punctuation | 939 | 1.0% |
| Other Punctuation | 869 | 0.9% |
| Open Punctuation | 115 | 0.1% |
| Close Punctuation | 115 | 0.1% |
| Decimal Number | 30 | < 0.1% |
| Control | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10212 | |
| r | 8849 | |
| a | 7786 | |
| e | 6780 | |
| n | 5528 | |
| o | 4380 | |
| l | 4079 | 6.0% |
| s | 4000 | 5.9% |
| t | 3921 | 5.8% |
| c | 1996 | 2.9% |
| Other values (28) | 10650 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5083 | |
| M | 1217 | 8.1% |
| S | 1138 | 7.6% |
| C | 910 | 6.0% |
| F | 901 | 6.0% |
| T | 679 | 4.5% |
| L | 661 | 4.4% |
| U | 534 | 3.5% |
| P | 513 | 3.4% |
| N | 496 | 3.3% |
| Other values (16) | 2939 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 7 | 4 | |
| 4 | 4 | |
| 2 | 3 | |
| 5 | 3 | |
| 1 | 3 | |
| 8 | 2 | 6.7% |
| 6 | 2 | 6.7% |
| 9 | 2 | 6.7% |
| 3 | 2 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 718 | |
| / | 109 | 12.5% |
| ' | 25 | 2.9% |
| , | 10 | 1.2% |
| & | 6 | 0.7% |
| ? | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9421 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 6 | ||
| 2 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 939 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83252 | |
| Common | 11498 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10212 | |
| r | 8849 | 10.6% |
| a | 7786 | 9.4% |
| e | 6780 | 8.1% |
| n | 5528 | 6.6% |
| A | 5083 | 6.1% |
| o | 4380 | 5.3% |
| l | 4079 | 4.9% |
| s | 4000 | 4.8% |
| t | 3921 | 4.7% |
| Other values (54) | 22634 |
Common
| Value | Count | Frequency (%) |
| 9421 | ||
| - | 939 | 8.2% |
| . | 718 | 6.2% |
| ( | 115 | 1.0% |
| ) | 115 | 1.0% |
| / | 109 | 0.9% |
| ' | 25 | 0.2% |
| , | 10 | 0.1% |
| 6 | 0.1% | |
| & | 6 | 0.1% |
| Other values (13) | 34 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94627 | |
| None | 123 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10212 | 10.8% |
| 9421 | 10.0% | |
| r | 8849 | 9.4% |
| a | 7786 | 8.2% |
| e | 6780 | 7.2% |
| n | 5528 | 5.8% |
| A | 5083 | 5.4% |
| o | 4380 | 4.6% |
| l | 4079 | 4.3% |
| s | 4000 | 4.2% |
| Other values (64) | 28509 |
None
| Value | Count | Frequency (%) |
| é | 102 | |
| á | 5 | 4.1% |
| Ã | 2 | 1.6% |
| Ã | 2 | 1.6% |
| ó | 2 | 1.6% |
| ç | 2 | 1.6% |
| ï | 2 | 1.6% |
| ã | 1 | 0.8% |
| ú | 1 | 0.8% |
| ê | 1 | 0.8% |
| Other values (3) | 3 | 2.4% |
ac_type
Categorical
| Distinct | 2468 |
|---|---|
| Distinct (%) | 49.4% |
| Missing | 13 |
| Missing (%) | 0.3% |
| Memory size | 39.2 KiB |
| Douglas DC-3 | 333 |
|---|---|
| de Havilland Canada DHC-6 Twin Otter 300 | 81 |
| Douglas C-47A | 70 |
| Douglas C-47 | 64 |
| Douglas DC-4 | 41 |
| Other values (2463) |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 18.541542 |
| Min length | 4 |
Characters and Unicode
| Total characters | 92615 |
|---|---|
| Distinct characters | 77 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1863 ? |
|---|---|
| Unique (%) | 37.3% |
Sample
| 1st row | Wright Flyer III |
|---|---|
| 2nd row | Wright Byplane |
| 3rd row | Dirigible |
| 4th row | Curtiss seaplane |
| 5th row | Zeppelin L-1 (airship) |
Common Values
| Value | Count | Frequency (%) |
| Douglas DC-3 | 333 | 6.6% |
| de Havilland Canada DHC-6 Twin Otter 300 | 81 | 1.6% |
| Douglas C-47A | 70 | 1.4% |
| Douglas C-47 | 64 | 1.3% |
| Douglas DC-4 | 41 | 0.8% |
| Antonov AN-26 | 35 | 0.7% |
| Yakovlev YAK-40 | 35 | 0.7% |
| Junkers JU-52/3m | 30 | 0.6% |
| De Havilland DH-4 | 27 | 0.5% |
| Douglas C-47B | 27 | 0.5% |
| Other values (2458) | 4252 |
Length
| Value | Count | Frequency (%) |
| douglas | 1130 | 8.3% |
| boeing | 418 | 3.1% |
| dc-3 | 387 | 2.8% |
| lockheed | 332 | 2.4% |
| de | 294 | 2.2% |
| havilland | 292 | 2.1% |
| antonov | 288 | 2.1% |
| canada | 159 | 1.2% |
| otter | 146 | 1.1% |
| ilyushin | 142 | 1.0% |
| Other values (2525) | 10025 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8649 | 9.3% | |
| - | 5180 | 5.6% |
| e | 4842 | 5.2% |
| o | 4638 | 5.0% |
| a | 4636 | 5.0% |
| n | 3856 | 4.2% |
| l | 3696 | 4.0% |
| i | 3486 | 3.8% |
| r | 3306 | 3.6% |
| C | 3034 | 3.3% |
| Other values (67) | 47292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46427 | |
| Uppercase Letter | 17900 | 19.3% |
| Decimal Number | 13808 | 14.9% |
| Space Separator | 8650 | 9.3% |
| Dash Punctuation | 5180 | 5.6% |
| Other Punctuation | 264 | 0.3% |
| Open Punctuation | 190 | 0.2% |
| Close Punctuation | 189 | 0.2% |
| Math Symbol | 3 | < 0.1% |
| Control | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4842 | |
| o | 4638 | |
| a | 4636 | |
| n | 3856 | 8.3% |
| l | 3696 | 8.0% |
| i | 3486 | 7.5% |
| r | 3306 | 7.1% |
| s | 2917 | 6.3% |
| t | 2357 | 5.1% |
| u | 2217 | 4.8% |
| Other values (18) | 10476 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3034 | |
| D | 2819 | |
| A | 1901 | |
| B | 1728 | |
| H | 1016 | 5.7% |
| L | 883 | 4.9% |
| F | 796 | 4.4% |
| S | 790 | 4.4% |
| I | 642 | 3.6% |
| T | 620 | 3.5% |
| Other values (16) | 3671 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2167 | |
| 0 | 2103 | |
| 1 | 2017 | |
| 3 | 1706 | |
| 4 | 1704 | |
| 7 | 1494 | |
| 6 | 875 | |
| 5 | 713 | 5.2% |
| 8 | 664 | 4.8% |
| 9 | 365 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 185 | |
| . | 76 | |
| , | 2 | 0.8% |
| & | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8649 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5180 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 190 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 189 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Control
| Value | Count | Frequency (%) |
| 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64327 | |
| Common | 28288 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4842 | 7.5% |
| o | 4638 | 7.2% |
| a | 4636 | 7.2% |
| n | 3856 | 6.0% |
| l | 3696 | 5.7% |
| i | 3486 | 5.4% |
| r | 3306 | 5.1% |
| C | 3034 | 4.7% |
| s | 2917 | 4.5% |
| D | 2819 | 4.4% |
| Other values (44) | 27097 |
Common
| Value | Count | Frequency (%) |
| 8649 | ||
| - | 5180 | |
| 2 | 2167 | 7.7% |
| 0 | 2103 | 7.4% |
| 1 | 2017 | 7.1% |
| 3 | 1706 | 6.0% |
| 4 | 1704 | 6.0% |
| 7 | 1494 | 5.3% |
| 6 | 875 | 3.1% |
| 5 | 713 | 2.5% |
| Other values (13) | 1680 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92596 | |
| None | 17 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8649 | 9.3% | |
| - | 5180 | 5.6% |
| e | 4842 | 5.2% |
| o | 4638 | 5.0% |
| a | 4636 | 5.0% |
| n | 3856 | 4.2% |
| l | 3696 | 4.0% |
| i | 3486 | 3.8% |
| r | 3306 | 3.6% |
| C | 3034 | 3.3% |
| Other values (62) | 47273 |
None
| Value | Count | Frequency (%) |
| é | 12 | |
| è | 4 | 23.5% |
| Â | 1 | 5.9% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| ’ | 1 |
Registros
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 4700 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 272 |
| Missing (%) | 5.4% |
| Memory size | 39.2 KiB |
| 49 | 3 |
|---|---|
| SU-AFK | 2 |
| 2 | 2 |
| 19 | 2 |
| CCCP-45012 | 2 |
| Other values (4695) |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 6.4940878 |
| Min length | 1 |
Characters and Unicode
| Total characters | 30756 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4665 ? |
|---|---|
| Unique (%) | 98.5% |
Sample
| 1st row | SC1 |
|---|---|
| 2nd row | L-48 |
| 3rd row | 97 |
| 4th row | 61 |
| 5th row | 82 |
Common Values
| Value | Count | Frequency (%) |
| 49 | 3 | 0.1% |
| SU-AFK | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
| 19 | 2 | < 0.1% |
| CCCP-45012 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| G-ADUZ | 2 | < 0.1% |
| VH-ABB | 2 | < 0.1% |
| OK-MCT | 2 | < 0.1% |
| I-BAUS | 2 | < 0.1% |
| Other values (4690) | 4715 | |
| (Missing) | 272 | 5.4% |
Length
| Value | Count | Frequency (%) |
| 39 | 0.8% | |
| hk | 4 | 0.1% |
| 49 | 3 | 0.1% |
| cccp | 2 | < 0.1% |
| 82 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| cf-tcl | 2 | < 0.1% |
| 12406 | 2 | < 0.1% |
| f-bbdm | 2 | < 0.1% |
| 204 | 2 | < 0.1% |
| Other values (4732) | 4772 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 3497 | 11.4% |
| C | 2022 | 6.6% |
| A | 1711 | 5.6% |
| 1 | 1541 | 5.0% |
| N | 1432 | 4.7% |
| 2 | 1246 | 4.1% |
| P | 1193 | 3.9% |
| 4 | 1187 | 3.9% |
| 5 | 1132 | 3.7% |
| 0 | 1098 | 3.6% |
| Other values (39) | 14697 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15946 | |
| Decimal Number | 11081 | |
| Dash Punctuation | 3497 | 11.4% |
| Other Punctuation | 119 | 0.4% |
| Space Separator | 90 | 0.3% |
| Control | 12 | < 0.1% |
| Lowercase Letter | 10 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2022 | 12.7% |
| A | 1711 | 10.7% |
| N | 1432 | 9.0% |
| P | 1193 | 7.5% |
| B | 718 | 4.5% |
| F | 690 | 4.3% |
| H | 636 | 4.0% |
| T | 611 | 3.8% |
| E | 560 | 3.5% |
| G | 559 | 3.5% |
| Other values (16) | 5814 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1541 | |
| 2 | 1246 | |
| 4 | 1187 | |
| 5 | 1132 | |
| 0 | 1098 | |
| 3 | 1037 | |
| 6 | 1026 | |
| 7 | 1015 | |
| 8 | 912 | |
| 9 | 887 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 5 | |
| y | 1 | 10.0% |
| e | 1 | 10.0% |
| o | 1 | 10.0% |
| w | 1 | 10.0% |
| d | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 114 | |
| ? | 5 | 4.2% |
Control
| Value | Count | Frequency (%) |
| 10 | ||
| 2 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3497 |
Space Separator
| Value | Count | Frequency (%) |
| 90 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15956 | |
| Common | 14800 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2022 | 12.7% |
| A | 1711 | 10.7% |
| N | 1432 | 9.0% |
| P | 1193 | 7.5% |
| B | 718 | 4.5% |
| F | 690 | 4.3% |
| H | 636 | 4.0% |
| T | 611 | 3.8% |
| E | 560 | 3.5% |
| G | 559 | 3.5% |
| Other values (22) | 5824 |
Common
| Value | Count | Frequency (%) |
| - | 3497 | |
| 1 | 1541 | |
| 2 | 1246 | 8.4% |
| 4 | 1187 | 8.0% |
| 5 | 1132 | 7.6% |
| 0 | 1098 | 7.4% |
| 3 | 1037 | 7.0% |
| 6 | 1026 | 6.9% |
| 7 | 1015 | 6.9% |
| 8 | 912 | 6.2% |
| Other values (7) | 1109 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30756 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 3497 | 11.4% |
| C | 2022 | 6.6% |
| A | 1711 | 5.6% |
| 1 | 1541 | 5.0% |
| N | 1432 | 4.7% |
| 2 | 1246 | 4.1% |
| P | 1193 | 3.9% |
| 4 | 1187 | 3.9% |
| 5 | 1132 | 3.7% |
| 0 | 1098 | 3.6% |
| Other values (39) | 14697 |
Todos_abordo
Real number (ℝ)
| Distinct | 244 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.120807 |
| Minimum | 0 |
|---|---|
| Maximum | 644 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 16 |
| Q3 | 34.25 |
| 95-th percentile | 117 |
| Maximum | 644 |
| Range | 644 |
| Interquartile range (IQR) | 27.25 |
Descriptive statistics
| Standard deviation | 45.402692 |
|---|---|
| Coefficient of variation (CV) | 1.4589176 |
| Kurtosis | 24.044967 |
| Mean | 31.120807 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 3.9275976 |
| Sum | 155853 |
| Variance | 2061.4044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 280 | 5.6% |
| 2 | 246 | 4.9% |
| 4 | 202 | 4.0% |
| 5 | 190 | 3.8% |
| 10 | 179 | 3.6% |
| 6 | 174 | 3.5% |
| 7 | 164 | 3.3% |
| 1 | 139 | 2.8% |
| 9 | 130 | 2.6% |
| 11 | 128 | 2.6% |
| Other values (234) | 3176 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 139 | |
| 2 | 246 | |
| 3 | 280 | |
| 4 | 202 | |
| 5 | 190 | |
| 6 | 174 | |
| 7 | 164 | |
| 8 | 119 | |
| 9 | 130 |
| Value | Count | Frequency (%) |
| 644 | 1 | |
| 524 | 1 | |
| 517 | 1 | |
| 394 | 1 | |
| 393 | 1 | |
| 384 | 1 | |
| 356 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 340 | 1 |
Pasajeros_a_bordo
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 234 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.882788 |
| Minimum | 0 |
|---|---|
| Maximum | 614 |
| Zeros | 869 |
| Zeros (%) | 17.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 13 |
| Q3 | 29 |
| 95-th percentile | 109.65 |
| Maximum | 614 |
| Range | 614 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 43.052562 |
|---|---|
| Coefficient of variation (CV) | 1.6014917 |
| Kurtosis | 25.436588 |
| Mean | 26.882788 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 4.0258255 |
| Sum | 134629 |
| Variance | 1853.5231 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 869 | 17.4% |
| 27 | 262 | 5.2% |
| 4 | 170 | 3.4% |
| 2 | 162 | 3.2% |
| 5 | 140 | 2.8% |
| 3 | 130 | 2.6% |
| 7 | 130 | 2.6% |
| 10 | 128 | 2.6% |
| 9 | 128 | 2.6% |
| 8 | 126 | 2.5% |
| Other values (224) | 2763 |
| Value | Count | Frequency (%) |
| 0 | 869 | |
| 1 | 120 | 2.4% |
| 2 | 162 | 3.2% |
| 3 | 130 | 2.6% |
| 4 | 170 | 3.4% |
| 5 | 140 | 2.8% |
| 6 | 109 | 2.2% |
| 7 | 130 | 2.6% |
| 8 | 126 | 2.5% |
| 9 | 128 | 2.6% |
| Value | Count | Frequency (%) |
| 614 | 1 | |
| 509 | 1 | |
| 503 | 1 | |
| 381 | 1 | |
| 374 | 1 | |
| 364 | 1 | |
| 338 | 1 | |
| 335 | 1 | |
| 327 | 1 | |
| 316 | 1 |
Tripulacion_abordo
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4968051 |
| Minimum | 0 |
|---|---|
| Maximum | 83 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 83 |
| Range | 83 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.6765019 |
|---|---|
| Coefficient of variation (CV) | 0.81758089 |
| Kurtosis | 65.889354 |
| Mean | 4.4968051 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.085092 |
| Sum | 22520 |
| Variance | 13.516666 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 954 | |
| 4 | 913 | |
| 2 | 828 | |
| 1 | 535 | |
| 5 | 514 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| 10 | 94 | 1.9% |
| Other values (24) | 263 | 5.3% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.1% |
| 1 | 535 | |
| 2 | 828 | |
| 3 | 954 | |
| 4 | 913 | |
| 5 | 514 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| Value | Count | Frequency (%) |
| 83 | 1 | < 0.1% |
| 61 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 4 |
cantidad de fallecidos
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 199 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.29353 |
| Minimum | 0 |
|---|---|
| Maximum | 583 |
| Zeros | 76 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 11 |
| Q3 | 25 |
| 95-th percentile | 85 |
| Maximum | 583 |
| Range | 583 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 34.972415 |
|---|---|
| Coefficient of variation (CV) | 1.5687248 |
| Kurtosis | 36.920237 |
| Mean | 22.29353 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 4.6259543 |
| Sum | 111646 |
| Variance | 1223.0698 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 384 | 7.7% |
| 2 | 377 | 7.5% |
| 3 | 363 | 7.2% |
| 4 | 242 | 4.8% |
| 5 | 235 | 4.7% |
| 6 | 176 | 3.5% |
| 7 | 160 | 3.2% |
| 10 | 159 | 3.2% |
| 13 | 132 | 2.6% |
| 9 | 128 | 2.6% |
| Other values (189) | 2652 |
| Value | Count | Frequency (%) |
| 0 | 76 | 1.5% |
| 1 | 384 | |
| 2 | 377 | |
| 3 | 363 | |
| 4 | 242 | |
| 5 | 235 | |
| 6 | 176 | |
| 7 | 160 | |
| 8 | 128 | 2.6% |
| 9 | 128 | 2.6% |
| Value | Count | Frequency (%) |
| 583 | 1 | |
| 520 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 329 | 1 | |
| 301 | 1 | |
| 298 | 1 | |
| 290 | 1 | |
| 275 | 1 | |
| 271 | 1 |
Pasajeros_fallecidos
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 190 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.896565 |
| Minimum | 0 |
|---|---|
| Maximum | 560 |
| Zeros | 1040 |
| Zeros (%) | 20.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 8 |
| Q3 | 20 |
| 95-th percentile | 79 |
| Maximum | 560 |
| Range | 560 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 33.256766 |
|---|---|
| Coefficient of variation (CV) | 1.759937 |
| Kurtosis | 38.93841 |
| Mean | 18.896565 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.7628844 |
| Sum | 94634 |
| Variance | 1106.0125 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1040 | |
| 1 | 308 | 6.2% |
| 18 | 304 | 6.1% |
| 2 | 263 | 5.3% |
| 3 | 193 | 3.9% |
| 4 | 185 | 3.7% |
| 5 | 139 | 2.8% |
| 6 | 133 | 2.7% |
| 8 | 126 | 2.5% |
| 7 | 126 | 2.5% |
| Other values (180) | 2191 |
| Value | Count | Frequency (%) |
| 0 | 1040 | |
| 1 | 308 | 6.2% |
| 2 | 263 | 5.3% |
| 3 | 193 | 3.9% |
| 4 | 185 | 3.7% |
| 5 | 139 | 2.8% |
| 6 | 133 | 2.7% |
| 7 | 126 | 2.5% |
| 8 | 126 | 2.5% |
| 9 | 118 | 2.4% |
| Value | Count | Frequency (%) |
| 560 | 1 | |
| 505 | 1 | |
| 335 | 1 | |
| 316 | 1 | |
| 307 | 1 | |
| 287 | 1 | |
| 283 | 1 | |
| 278 | 1 | |
| 258 | 1 | |
| 257 | 1 |
Tripulacionfallecida
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5597045 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 400 |
| Zeros (%) | 8.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.1043418 |
|---|---|
| Coefficient of variation (CV) | 0.87207852 |
| Kurtosis | 13.683758 |
| Mean | 3.5597045 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.5794338 |
| Sum | 17827 |
| Variance | 9.636938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1059 | |
| 2 | 892 | |
| 1 | 771 | |
| 4 | 591 | |
| 5 | 402 | 8.0% |
| 0 | 400 | 8.0% |
| 6 | 273 | 5.5% |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| 9 | 87 | 1.7% |
| Other values (18) | 232 | 4.6% |
| Value | Count | Frequency (%) |
| 0 | 400 | 8.0% |
| 1 | 771 | |
| 2 | 892 | |
| 3 | 1059 | |
| 4 | 591 | |
| 5 | 402 | 8.0% |
| 6 | 273 | 5.5% |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| 9 | 87 | 1.7% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 23 | 6 | |
| 22 | 5 | |
| 21 | 2 | < 0.1% |
| 20 | 3 | |
| 19 | 5 | |
| 18 | 3 |
suelo
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7208466 |
| Minimum | 0 |
|---|---|
| Maximum | 2750 |
| Zeros | 4716 |
| Zeros (%) | 94.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 2750 |
| Range | 2750 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 55.251174 |
|---|---|
| Coefficient of variation (CV) | 32.106971 |
| Kurtosis | 2445.2883 |
| Mean | 1.7208466 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 49.203131 |
| Sum | 8618 |
| Variance | 3052.6922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4716 | |
| 2 | 78 | 1.6% |
| 1 | 63 | 1.3% |
| 3 | 21 | 0.4% |
| 4 | 16 | 0.3% |
| 5 | 12 | 0.2% |
| 7 | 10 | 0.2% |
| 8 | 9 | 0.2% |
| 10 | 6 | 0.1% |
| 6 | 6 | 0.1% |
| Other values (41) | 71 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 4716 | |
| 1 | 63 | 1.3% |
| 2 | 78 | 1.6% |
| 3 | 21 | 0.4% |
| 4 | 16 | 0.3% |
| 5 | 12 | 0.2% |
| 6 | 6 | 0.1% |
| 7 | 10 | 0.2% |
| 8 | 9 | 0.2% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2750 | 2 | |
| 225 | 1 | |
| 125 | 2 | |
| 113 | 1 | |
| 87 | 1 | |
| 85 | 1 | |
| 78 | 1 | |
| 71 | 1 | |
| 63 | 1 | |
| 58 | 1 |
Resumen
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 4857 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 59 |
| Missing (%) | 1.2% |
| Memory size | 39.2 KiB |
| Crashed under unknown circumstances. | 9 |
|---|---|
| Crashed while en route. | 8 |
| Crashed while attempting to land. | 7 |
| Crashed during takeoff. | 6 |
| Crashed into the sea. | 5 |
| Other values (4852) |
Length
| Max length | 2669 |
|---|---|
| Median length | 787 |
| Mean length | 223.39382 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1105576 |
|---|---|
| Distinct characters | 101 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 4813 ? |
|---|---|
| Unique (%) | 97.3% |
Sample
| 1st row | During a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later. |
|---|---|
| 2nd row | Eugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show. |
| 3rd row | First U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight. |
| 4th row | The first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed. |
| 5th row | The airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants. |
Common Values
| Value | Count | Frequency (%) |
| Crashed under unknown circumstances. | 9 | 0.2% |
| Crashed while en route. | 8 | 0.2% |
| Crashed while attempting to land. | 7 | 0.1% |
| Crashed during takeoff. | 6 | 0.1% |
| Crashed into the sea. | 5 | 0.1% |
| Crashed shortly after taking off. | 5 | 0.1% |
| Crashed on takeoff. | 4 | 0.1% |
| Shot down by rebel forces. | 4 | 0.1% |
| Crashed under unknown circumstances | 4 | 0.1% |
| Crashed en route. | 4 | 0.1% |
| Other values (4847) | 4893 | |
| (Missing) | 59 | 1.2% |
Length
| Value | Count | Frequency (%) |
| the | 18463 | 10.1% |
| of | 5544 | 3.0% |
| a | 5456 | 3.0% |
| and | 5444 | 3.0% |
| to | 5429 | 3.0% |
| in | 3682 | 2.0% |
| crashed | 3386 | 1.8% |
| was | 2779 | 1.5% |
| aircraft | 2557 | 1.4% |
| into | 2360 | 1.3% |
| Other values (11568) | 127976 |
Most occurring characters
| Value | Count | Frequency (%) |
| 179362 | ||
| e | 104905 | 9.5% |
| t | 81905 | 7.4% |
| a | 79924 | 7.2% |
| n | 68116 | 6.2% |
| i | 65870 | 6.0% |
| r | 63437 | 5.7% |
| o | 62600 | 5.7% |
| h | 42794 | 3.9% |
| s | 39810 | 3.6% |
| Other values (91) | 316853 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 869373 | |
| Space Separator | 179369 | 16.2% |
| Uppercase Letter | 25294 | 2.3% |
| Other Punctuation | 20624 | 1.9% |
| Decimal Number | 8853 | 0.8% |
| Dash Punctuation | 1645 | 0.1% |
| Close Punctuation | 158 | < 0.1% |
| Open Punctuation | 140 | < 0.1% |
| Final Punctuation | 67 | < 0.1% |
| Control | 33 | < 0.1% |
| Other values (4) | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 104905 | |
| t | 81905 | 9.4% |
| a | 79924 | 9.2% |
| n | 68116 | 7.8% |
| i | 65870 | 7.6% |
| r | 63437 | 7.3% |
| o | 62600 | 7.2% |
| h | 42794 | 4.9% |
| s | 39810 | 4.6% |
| d | 38411 | 4.4% |
| Other values (30) | 221601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 5796 | |
| C | 2775 | |
| A | 2579 | |
| S | 1531 | 6.1% |
| F | 1286 | 5.1% |
| M | 1207 | 4.8% |
| I | 1063 | 4.2% |
| P | 960 | 3.8% |
| W | 924 | 3.7% |
| N | 861 | 3.4% |
| Other values (16) | 6312 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13487 | |
| , | 5721 | |
| ' | 771 | 3.7% |
| " | 362 | 1.8% |
| / | 170 | 0.8% |
| : | 56 | 0.3% |
| ; | 34 | 0.2% |
| & | 17 | 0.1% |
| % | 3 | < 0.1% |
| # | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2668 | |
| 1 | 1368 | |
| 2 | 1042 | 11.8% |
| 5 | 830 | 9.4% |
| 3 | 820 | 9.3% |
| 4 | 578 | 6.5% |
| 6 | 432 | 4.9% |
| 7 | 416 | 4.7% |
| 8 | 386 | 4.4% |
| 9 | 313 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 179362 | ||
| Â | 7 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 | |
| ] | 1 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 139 | |
| [ | 1 | 0.7% |
Control
| Value | Count | Frequency (%) |
| 32 | ||
| 1 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1645 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 7 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 894667 | |
| Common | 210909 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 104905 | |
| t | 81905 | 9.2% |
| a | 79924 | 8.9% |
| n | 68116 | 7.6% |
| i | 65870 | 7.4% |
| r | 63437 | 7.1% |
| o | 62600 | 7.0% |
| h | 42794 | 4.8% |
| s | 39810 | 4.4% |
| d | 38411 | 4.3% |
| Other values (56) | 246895 |
Common
| Value | Count | Frequency (%) |
| 179362 | ||
| . | 13487 | 6.4% |
| , | 5721 | 2.7% |
| 0 | 2668 | 1.3% |
| - | 1645 | 0.8% |
| 1 | 1368 | 0.6% |
| 2 | 1042 | 0.5% |
| 5 | 830 | 0.4% |
| 3 | 820 | 0.4% |
| ' | 771 | 0.4% |
| Other values (25) | 3195 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1105434 | |
| None | 72 | < 0.1% |
| Punctuation | 70 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 179362 | ||
| e | 104905 | 9.5% |
| t | 81905 | 7.4% |
| a | 79924 | 7.2% |
| n | 68116 | 6.2% |
| i | 65870 | 6.0% |
| r | 63437 | 5.7% |
| o | 62600 | 5.7% |
| h | 42794 | 3.9% |
| s | 39810 | 3.6% |
| Other values (73) | 316711 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 | |
| ‘ | 3 | 4.3% |
None
| Value | Count | Frequency (%) |
| é | 20 | |
| á | 15 | |
| Ã | 8 | 11.1% |
| Â | 7 | 9.7% |
| ó | 3 | 4.2% |
| ° | 3 | 4.2% |
| ö | 3 | 4.2% |
| ã | 2 | 2.8% |
| â | 2 | 2.8% |
| ü | 2 | 2.8% |
| Other values (6) | 7 | 9.7% |
Año_realializado
Real number (ℝ)
| Distinct | 111 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1970.8516 |
| Minimum | 1908 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 1908 |
|---|---|
| 5-th percentile | 1931 |
| Q1 | 1951 |
| median | 1970 |
| Q3 | 1992 |
| 95-th percentile | 2010 |
| Maximum | 2021 |
| Range | 113 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 24.703696 |
|---|---|
| Coefficient of variation (CV) | 0.012534528 |
| Kurtosis | -0.95072008 |
| Mean | 1970.8516 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.032020334 |
| Sum | 9870025 |
| Variance | 610.27257 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1946 | 88 | 1.8% |
| 1989 | 83 | 1.7% |
| 1947 | 82 | 1.6% |
| 1948 | 78 | 1.6% |
| 1962 | 78 | 1.6% |
| 1972 | 77 | 1.5% |
| 1945 | 75 | 1.5% |
| 1951 | 75 | 1.5% |
| 1994 | 74 | 1.5% |
| 1970 | 73 | 1.5% |
| Other values (101) | 4225 |
| Value | Count | Frequency (%) |
| 1908 | 1 | < 0.1% |
| 1909 | 1 | < 0.1% |
| 1912 | 1 | < 0.1% |
| 1913 | 3 | 0.1% |
| 1915 | 2 | < 0.1% |
| 1916 | 5 | 0.1% |
| 1917 | 7 | 0.1% |
| 1918 | 4 | 0.1% |
| 1919 | 9 | |
| 1920 | 18 |
| Value | Count | Frequency (%) |
| 2021 | 7 | 0.1% |
| 2020 | 8 | 0.2% |
| 2019 | 13 | |
| 2018 | 19 | |
| 2017 | 15 | |
| 2016 | 23 | |
| 2015 | 18 | |
| 2014 | 23 | |
| 2013 | 25 | |
| 2012 | 26 |
| Unnamed: 0 | Todos_abordo | Pasajeros_a_bordo | Tripulacion_abordo | cantidad de fallecidos | Pasajeros_fallecidos | Tripulacionfallecida | suelo | Año_realializado | |
|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.170 | 0.153 | 0.107 | 0.110 | 0.098 | 0.040 | 0.027 | 1.000 |
| Todos_abordo | 0.170 | 1.000 | 0.949 | 0.659 | 0.743 | 0.773 | 0.367 | 0.033 | 0.170 |
| Pasajeros_a_bordo | 0.153 | 0.949 | 1.000 | 0.512 | 0.697 | 0.819 | 0.238 | 0.016 | 0.152 |
| Tripulacion_abordo | 0.107 | 0.659 | 0.512 | 1.000 | 0.516 | 0.390 | 0.690 | 0.085 | 0.107 |
| cantidad de fallecidos | 0.110 | 0.743 | 0.697 | 0.516 | 1.000 | 0.926 | 0.669 | -0.007 | 0.110 |
| Pasajeros_fallecidos | 0.098 | 0.773 | 0.819 | 0.390 | 0.926 | 1.000 | 0.464 | -0.024 | 0.098 |
| Tripulacionfallecida | 0.040 | 0.367 | 0.238 | 0.690 | 0.669 | 0.464 | 1.000 | 0.036 | 0.040 |
| suelo | 0.027 | 0.033 | 0.016 | 0.085 | -0.007 | -0.024 | 0.036 | 1.000 | 0.027 |
| Año_realializado | 1.000 | 0.170 | 0.152 | 0.107 | 0.110 | 0.098 | 0.040 | 0.027 | 1.000 |
| Unnamed: 0 | fecha | Ruta | OperadOR | ac_type | Registros | Todos_abordo | Pasajeros_a_bordo | Tripulacion_abordo | cantidad de fallecidos | Pasajeros_fallecidos | Tripulacionfallecida | suelo | Resumen | Año_realializado | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1908-09-17 | Fort Myer, Virginia | Military - U.S. Army | Wright Flyer III | NaN | 2 | 1 | 1 | 1 | 1 | 0 | 0 | During a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later. | 1908 |
| 1 | 1 | 1909-09-07 | Juvisy-sur-Orge, France | NaN | Wright Byplane | SC1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | Eugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show. | 1909 |
| 2 | 2 | 1912-07-12 | Atlantic City, New Jersey | Military - U.S. Navy | Dirigible | NaN | 5 | 0 | 5 | 5 | 0 | 5 | 0 | First U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight. | 1912 |
| 3 | 3 | 1913-08-06 | Victoria, British Columbia, Canada | Private | Curtiss seaplane | NaN | 1 | 0 | 1 | 1 | 0 | 1 | 0 | The first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed. | 1913 |
| 4 | 4 | 1913-09-09 | Over the North Sea | Military - German Navy | Zeppelin L-1 (airship) | NaN | 20 | 27 | 4 | 14 | 18 | 3 | 0 | The airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants. | 1913 |
| 5 | 5 | 1913-10-17 | Near Johannisthal, Germany | Military - German Navy | Zeppelin L-2 (airship) | NaN | 28 | 27 | 4 | 28 | 18 | 3 | 0 | Hydrogen gas which was being vented was sucked into the forward engine and ignited causing the airship to explode and burn at 3,000 ft..German Navy's Zeppelin airships L-4 and L-5 were blown out to sea in February 1915, never to be seen again. | 1913 |
| 6 | 6 | 1915-03-05 | Tienen, Belgium | Military - German Navy | Zeppelin L-8 (airship) | NaN | 41 | 0 | 41 | 17 | 0 | 17 | 0 | Crashed into trees while attempting to land after being shot down by British and French aircraft. | 1915 |
| 7 | 7 | 1915-09-03 | Off Cuxhaven, Germany | Military - German Navy | Zeppelin L-10 (airship) | NaN | 19 | 27 | 4 | 19 | 18 | 3 | 0 | Exploded and burned near Neuwerk Island, when hydrogen gas, being vented, was ignited by lightning. | 1915 |
| 8 | 8 | 1916-07-28 | Near Jambol, Bulgeria | Military - German Army | Schutte-Lanz S-L-10 (airship) | NaN | 20 | 27 | 4 | 20 | 18 | 3 | 0 | Crashed near the Black Sea, cause unknown. | 1916 |
| 9 | 9 | 1916-09-24 | Billericay, England | Military - German Navy | Zeppelin L-32 (airship) | NaN | 22 | 27 | 4 | 22 | 18 | 3 | 0 | Shot down by British aircraft crashing in flames. | 1916 |
| Unnamed: 0 | fecha | Ruta | OperadOR | ac_type | Registros | Todos_abordo | Pasajeros_a_bordo | Tripulacion_abordo | cantidad de fallecidos | Pasajeros_fallecidos | Tripulacionfallecida | suelo | Resumen | Año_realializado | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4998 | 4998 | 2020-08-07 | Calicut, India | Air India Exppress | Boeing 737-8HG | VT-AXH | 190 | 184 | 6 | 20 | 18 | 2 | 0 | The flight IX344 suffered a runway excursion while landing at Kozhikode-Calicut Airport in heavy rain. The nose section separated from the fuselage after going down a steep slope at the end of the runway. The pilot and copilot were among the dead. Low visibility, wet runway, low cloud base and poor braking action possibly contributed to the accident. | 2020 |
| 4999 | 4999 | 2020-08-22 | Juba, South Sudan | South West Aviaiton | Antonov 26B | EX-126 | 8 | 5 | 3 | 7 | 4 | 3 | 0 | The cargo plane lost height shortly after departure from Juba Airport and impacted a farm near Hai Referendum about 3nm southwest of the airport. One passenger survived in critical condition. The plane was chartered by the World Food Program to transport supplies and wages to Wau and Aweil. | 2020 |
| 5000 | 5000 | 2020-09-25 | Near Chuguev, Ukraine | Military - Ukraine Air Force | Antonov An26SH | 76 yellow | 27 | 20 | 7 | 26 | 19 | 7 | 0 | The military transport, crashed 1.2 miles from Chuguev air base. The plane was carrying cadets from a nearby air force university on a training flight. The crew may have reported failure of an engine prior to the accident. | 2020 |
| 5001 | 5001 | 2021-01-09 | Near Jakarta, Indonesia | Sriwijaya Air | Boeing 737-524 | PK-CLC | 62 | 56 | 6 | 62 | 56 | 6 | 0 | Sriwijaya Air flight 182 was climbing through 10,900 ft., 11 nm north of Jakarta-Soekarno-Hatta International Airport, over the Java Sea when radar and radio contact was lost. The aircraft then lost height rapidly and impacted the Java Sea. Debris was located near Lancang Island. | 2021 |
| 5002 | 5002 | 2021-03-02 | Pieri, Sudan | South Sudan Supreme Airlines | Let L-410UVP-E | HK-4274 | 10 | 8 | 2 | 10 | 8 | 2 | 0 | One of the engines on the aircraft failed 10 minutes after takeof. When the plane turned back, the second engine failed. | 2021 |
| 5003 | 5003 | 2021-03-28 | Near Butte, Alaska | Soloy Helicopters | Eurocopter AS350B3Â Ecureuil | N351SH | 6 | 5 | 1 | 5 | 4 | 1 | 0 | The sightseeing helicopter crashed after missing the top of a 6,000 ft mountain by just 10 - 15 ft. The crash site was near Knik glacier. The pilot, and four others were killed including Czech billionaire Petr Kellner. | 2021 |
| 5004 | 5004 | 2021-05-21 | Near Kaduna, Nigeria | Military - Nigerian Air Force | Beechcraft B300 King Air 350i | NAF203 | 11 | 7 | 4 | 11 | 7 | 4 | 0 | While on final approach, in poor weather conditions, the aircraft crashed and burst into flames less than 10 km from Kaduna Airport. All 11 occupants were killed, incuding General Ibrahim Attahiru, Chief of Staff of the Nigerian Army. | 2021 |
| 5005 | 5005 | 2021-06-10 | Near Pyin Oo Lwin, Myanmar | Military - Myanmar Air Force | Beechcraft 1900D | 4610 | 14 | 12 | 2 | 12 | 11 | 1 | 0 | The plane was carrying military personnel and monks when it crashed about 300 meters from a steel plant in the Mandalay region. The plane was attempting to land in poor weather conditions and broke into three pieces. | 2021 |
| 5006 | 5006 | 2021-07-04 | Patikul, Sulu, Philippines | Military - Philippine Air Force | Lockheed C-130H Hercules | 5125 | 96 | 88 | 8 | 50 | 18 | 3 | 3 | While attempting to land at Jolo Airport, the military transport overran the runway, struck two houses and burst into flames coming to rest on a coconut plantation. | 2021 |
| 5007 | 5007 | 2021-07-06 | Palana, Russia | Kamchatka Aviation Enterprise | Antonov An 26B-100 | RA-26085 | 28 | 22 | 6 | 28 | 22 | 6 | 0 | The passenger plane crashed into the top of a cliff while attempting to land in inclement weather. The debris fell into the sea. Contact was lost with the plane 10 minutes before it was to land. | 2021 |